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Abstract. We define a new descriptor for persistent homology, 
which we call the persistence landscape, for the purpose of facili- 
tating statistical inference. This descriptor may be thought of as 
an embedding of the usual descriptors, barcodes and persistence 
diagrams, into a space of functions, which inherits an LP norm. 
We show that the corresponding metric is topologically equivalent 
to the (p + 1)-Wasserstein distance, and that this metric space is 
complete and separable. We prove a stability theorem for persis- 
tence landscapes. For p = 2, we show that the Frechet mean of 
persistence landscapes is the pointwise mean, and that the Frechet 
variance is the integral of the pointwise variances. Furthermore, 
the sample mean of persistence landscapes converges pointwise to 
the mean of the underlying distribution, and there is a correspond- 
ing central limit theorem. 



1.1. Persistent homology. One of the main tools in applied topology 
is persistent homology Let us start with a brief overview of one of the 
ways in which it is calculated. Data from an application is processed so 
that it may be considered to be a finite set of points in Euclidean space, 
or more generally some Riemannian manifold. To this point cloud we 
apply a construction, such as the Cech complex, Vietoris-Rips complex, 
or alpha complex that results in a simplicial complex that depended 
on one parameter. For example, the Cech complex is the nerve of the 
union of balls of fixed radius centered at each of the data points. 

As this parameter increases, our simplicial complex grows. We may 
consider the resulting object, X, as a finite sequence of simplicial com- 
plexes, together with the corresponding inclusion maps, or as a filtered 
simplicial complex. Taking simplicial homology with coefficients in 
some fixed field, we obtain a finite sequence of vector spaces, together 
with linear maps induced by the earlier inclusions. 
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Homology classes which are in the images of these maps are referred 
to as persistent homology classes. Remarkably, there is an efficiently- 
computable complete invariant for ([T]), which is referred to as the per- 
sistent homology of X. 

Persistent homology has two equivalent^ standard descriptors: the 
barcode, consisting of a multiset of intervals, and the persistence dia- 
gram, consisting of a multiset of ordered pairs. Informally, an interval 
starting at a and ending at 6, or an ordered pair (a, b) corresponds to a 
persistent homology class that is born at M a and dies at For more 
details see the articles OHUj, or the books [HIE]. 

1.2. Metrics for persistent homology. Given two descriptors for 
persistent homology, one is immediately led to ask the following ques- 
tion. How close together are they? Because longer intervals correspond 
to homology classes that persist through a wider range of parameters, 
we would like our metric to give greater weight to longer intervals than 
to shorter ones. One suitable and commonly used metric is the Wasser- 
stein distance, W p , that has an U L P version" for each of 1 < p < oc 



(Definition 3.1). 



1.3. Probability and statistics. A good framework for probability 
theory is that of a Polish space. A metric space is a Polish space if and 
only if it is complete and separable. It has been shown [7J that the 
space of persistence diagrams with the Wasserstein distance is indeed 
a Polish space. 

In the setting of a metric space, a good notion of mean and variance 
is given by the Frechet mean and the Frechet variance (Definition Q 



Some nice existence results for Frechet means of persistence diagrams 
were proven in [7J. One of the main motivations for the work in the 
present paper was to provide a setting in which one can calculate the 
Frechet mean and variance for persistent homology. For a recent alter- 
native approach, which gives an algorithm for calculating the Frechet 
mean of a variant of the Wasserstein distance, see [9]. 

1.4. Persistence landscapes. In this paper, we describe a new de- 
scriptor for persistent homology which we call the persistent landscape. 
For k e N and t G R, the value of the persistence landscape of a bar- 
code is the maximum radius (half the length) of an interval centered at 
t that is contained in k of the intervals in the barcode. It has a similar 
definition for a diagram of vector spaces and linear maps, such as ([!]). 



1 In some other scenarios, the barcode can contain more information, as it allows 
one to differentiate between intervals that do or do not contain their endpoints. 
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The main technical advantage of this descriptor is that it is a real- 
valued function and thus the space of persistence landscapes inherits 
an L p norm and corresponding metric. In addition, some of what we 
would like to calculate can be done in a pointwise fashion. 

Furthermore, for each of the L p norms, the persistence landscape 
gives, by definition, greater weight to longer intervals than to shorter 
ones. So it satisfies our main requirement for a metric for persistent 
homology. 

1.5. Outline and summary of main results. We start by defining 
persistence landscapes and giving some basic results in Section [2} 

In the third section we prove that for finite persistence diagrams, 
the metric given by the L p distance between the corresponding persis- 
tence landscapes is topologically equivalent to the (p + 1)-Wasserstein 



distance (Theorem 3.2). 



In Section [4j we show that the persistence landscape of a tame Lip- 
schitz function on a suitably nice space is stable with respect to the 



supremum norm (Theorem 4.1) 



In the fifth section we prove (Theorem 5.3) that the space of per- 



sistence landscapes with the metric inherited from the L p norm is a 



Polish space. We also prove (Theorem 5.15) that for p — 2, the Frechet 
mean of persistence landscapes is just the pointwise mean, and that the 
Frechet variance is the integral of the pointwise variances. From the 
strong law of large numbers, we get that the sample mean of persistence 
landscapes converges pointwise to the Frechet mean of the underlying 



distribution (Corollary 5.17). From the central limit theorem, we get 



a pointwise central limit theorem for the sample mean of persistence 



landscapes (Corollary 5.19) 



We end by applying these results to points sampled from a torus and 
a sphere in Section [6| 

1.6. Notation. Let us fix some notation. Let N = {1, 2, 3, . . .} and let 
K = IRU {— oc, oc}. We denote the domain and codomain of a function 
/, by dom(/) and cod(/), respectively. 

2. Persistence Landscapes 

2.1. Persistence landscapes from persistence diagrams and bar- 
codes. In this section we describe the persistence landscape corre- 
sponding to a finite persistence diagram and a finite barcode. 

A persistence diagrarr^ [2j is a multiset of pairs of real numbers (a, b) 
with a < b. A barcode [10J is a multiset of intervals. Here we consider a 



In [2] this is called a reduced persistence diagram. 
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Figure 1. The graph of /(i,5). 



finite persistence diagram, {(a^, with < 6^, or a finite barcode, 

{Ii}i=i- I n the second case we use the endpoints < bi of interval 

First we give an informal description of the persistence landscape 
of a persistence diagram. For each i e {1, . . . , n}, draw an isosceles 
triangle with base the interval [a^ bj\ on the horizontal t-axis, and sides 
with slope 1 and —1. So the triangle has vertices (a^,0), (6^,0) and 



( 



a>i+bi bj-aj 



). This subdivides the plane into a number of polygonal 



regions. Label each of these regions by the number of triangles con- 
taining it. For fcGN, let be the union of the polygonal regions with 
values at least k. Let A& : R — >■ R be the function whose graph is the 
upper contour of with A&(a) = if the vertical line t — a does not 
intersect 

Next we give a more formal description. Given (a, b) with a < 6, 
define / (a>6) : R R by 



(2) 



f(a,b)(t) = mm(t - a, b - £)_ 



then 



where c + denotes max(c, 0). Note that if m — ^ and h = - 

f(a,b)(t) = (h- \t-m\)+. 

That is, f{a,b) i s the piecewise linear bump function with height h and 
bandwidth /i, centered at m. See Figure [T] for an example. 

Now define the persistence landscape of {(a^, bi)}^ =1 to be the set of 
functions A& : R —> R, k e N, given by 

Ajfe(t) = fcth largest value of {/(a^WKU; 
with Ajfe(t) = if fc > n. See Figure [2] for an example. 
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Figure 2. The graph of the persistence landscape of 
{(0,10), (5, 8), (6, 14)}. 

Finally we give some motivation for this construction. We may think 
of the process of replacing a barcode with a persistence landscape as 
replacing each interval in a barcode with the set of all of its subintervals. 
To make this more precise, let us change the coordinates from (a, b) to 
(ra, h), where m = ^ and h = We are changing from birth - 

death coordinates to mid-lifetime (mid-life) - half time alive (half-life) 
coordinates. The union of all points (mf h!) corresponding to intervals 
V C I is the isosceles triangle with vertices (a, 0), (m,h) and (6,0). 
Then 

Afc(t) = sup{/i | the interval centered at t with radius h is 

contained in k intervals in the barcode). 

2.2. Diagrams of vector spaces indexed by (R, <). Here we review 
a framework for persistent homology developed in [1] . 

Let M be a diagram of finite dimensional vector spaces indexed by 
(R, <). That is, for each a G R, M(a) is a finite dimensional vector 
space over some field, F, and for all a < 6, we have a linear map 
M(a) — M(6), denoted M(a < 6), satisfying M(a < a) = Id M ( a ) for 
all a G R, and for all a < 6 < c, M(a < c) = M(6 < c) o M(a < 6)0 
We write M G Vec( R '^). 

For p > 0, define the p-persistence of M(a) to the image of the map 
M(a < a+p). For a < 6, define the corresponding £>eiiz number of M, 
by 

/3 a ' 6 (M) = dim(im(M(a < b))). 



That is, M is a functor from the poset (R, <) to the category of finite- 
dimensional vector spaces. 
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For an interval I C R, we define xi £ Vec^ R '-^ by 

(3) X,(c) = ( F ' [ice / and Xl (c < d) = / M " ifC ' dG/ ' 

I 0, if c /, v 10, otherwise. 

For a multiset S = {^}£=i, where Ii is an interval, define xs £ Vec( R '-) 
by 

(4) xs(c) = 0?=iXi<(c), and < d) = eJ^Xi^c < d). 

Say that M G Vec^ has finite type if M = xs for some 5 = 
Write Vec[f'~^ for the set of all elements in Vec( R '-) that have finite 
type. In [lj, it is shown that such an isomorphism is unique up to 
reordering. Thus, 

Theorem 2.1 ([1, Corollary 4.9]). There is a bijection between finite 
barcodes and isomorphism classes of finite type diagrams, given by S = 

If M G Vec( R '-) has finite type, let I(M) be the inverse image of 

— 2 

this bijection. For an interval /, let x(I) = (inf (/), sup(/)) G R . For 
a multiset of intervals, S = let D(S) denote the multiset {x(Ii)}. 
Let D(M) denote D(I(M)). 

It is also shown in [lj, that the finite- type diagrams are exactly those 
with finitely many critical values. 

2.3. Persistence landscapes from diagrams. Let us start with the 
observation that for a fixed diagram M and fixed t G R, /3* -# ' t+# (M) is 
a decreasing function. That is, 

Lemma 2.2. Let M G Vec (R ^ } and t G R. Then for0<s< s' , 

Proof. Since t - s f < t - s < t + s < t + s', M(t - s' < t + s') = 

M(t + s<t + s')o M(t -s<t + s)o M(t -s' <t-s). It follows that 

fits', t+J ( M ) < pt-s,t+8 ( M ) n 

Definition 2.3. Let M G Vec( R '-). The persistence landscape of M is 
the sequence of functions {A&(M) : R — >> R}&eN given by 

X k (M)(t) = sup(s > | P*- 8 '** 8 ^) > k). 

Define A(M) : N x R -> R by A(M)(fc, t) = \ k (M)(t). 



In the next two examples, we show that in light of Theorem 2.1, this 



definition does generalize the definition in Section 2.1 



STATISTICAL TOPOLOGY USING PERSISTENCE LANDSCAPES 7 

Example 2.4. Let / be an interval with endpoints — oc<a<6<oc. 
We will calculate the persistence landscape of the diagram xi, defined 
in @). 

Since dim(x/(c)) < 1 for all cGR, for each k > 2, A&(xi) = 0. For 
the remaining calculation, 

(5) Ai(x/(t)) = sup(s > | dim(im(x/(* - s < t + s))) > 1). 

If t ^ I : then Xi(t) = and since Xi{t — s < t + s) = x(t < t + s) o 
x(t — s < t), it follows that the right hand side of ^ equals 0. If t G /, 
then 

Ai(xj)(*) = SU P(> > | t-s,t + s G J) 
= min(£ — a, 6 — t). 

Therefore, Ai(xi)(f) = min(t - a, 6 - t)+ = / (a>6) (t) (see @). □ 

Example 2.5. Let 5 = where l{ is an interval with endpoints 

— oc < di < bi < oc. Consider the diagram xs G Vec^ R '-\ defined in 
Q. Then, 

A fc (x5)(*) = sup( 5 > | ^-^(xs) > k) 

= sup^>0| J2^ t+S (xu)>k 

If k > n, then the right hand side is 0. Otherwise, 

Ajfe(xs)(*) = fcth largest element of sup(s > | /3 t_s ' t+s (x/J > 1) 
= fcth largest element of Ai(xjJ(£) 
= fcth largest element of /( ai A)(*)- 

□ 

Next we establish some basic properties of the persistence landscape. 

Lemma 2.6. Afc(M) : M — » K. satisfies the following properties: 

(1) A fc (M)(*) >0, 

(2) A fc (M)(*) > A fc+ i(M)(*), and 

(3) \ k (M)(t) is 1-Lipschitz. 



Proof. The first two properties in Lemma 2i) following immediately 
from Definition 2.3, It remains to prove that X k (M)(t) is 1-Lipschitz. 
That is, \X k (M)(t) - A fc (M)(s)| < \t - s\, for all s,teR. 

Let s,t e i. Without loss of generality, assume that Afc(M)(£) > 
X k (M)(s). UX k {M)(t) < \t-s\, then X k (M)(t)-X k (M)(s) < X k (M)(t) < 
\t — s\ and we are done. So assume that X k (M)(t) > \t — s\. 
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Let < e < X k (M)(t) -\t- s\. By Definition [Z3] and Lemma |Z2 

(g) pt-\ k (M)(t)+s,t+\ k (M)(t)-e > 

Since —\t — s\ < t — s < \t — s|, s — \t — s\ < t < s + \t — s\. Also, 
-\ k (M)(t) + \t-s\+e < < X k (M)(t) -\t-s\-e. Combining these, 
we have, 

t - X k (M)(t) + £ < s - X k (M)(t) + \t-s\+£<s< 

<s + X k (M)(t) -\t-s\-£<t + X k (M)(t) - £. 

Together with ^ and Lemma |2.2[ we see that 

ps-\ k (M)(t)+\t-s\+e,s+\ k (M)(t)-\t-s\-e > ^ 

Therefore X k (M) (s) > X k (M)(t) -\t- s\. Thus X k (M)(t) - X k (M)(s) < 
\t-s\. □ 

To take advantage of the linear structure of the set of functions from 
N x K to K, we enlarge the set of persistence landscapes to its convex 
hull. 

Definition 2.7. Let PL be the convex hull of the set {A(M) | M G 
Vec( R '% 

The following characterization of functions in PL follows from the 
definitions. 

Lemma 2.8. The functions f : N x R — >> R in PL satisfy the following 
properties: 

(1) /(M)>o, 

(2) f(k,t) > f(k + l,t), and 

(3) f{k ) t) is 1 - Lip schitz for fixed k. □ 

To help visualize the graph of / : N x R — >■ R, we can extend it to a 
function / : R 2 —> R by setting 



(7) f(x,t) 



See Figure [3] for an example. 




if x > 0, 
if x < 0. 



2.4. Norms and metrics for persistence landscapes. Let (X, S, /i) 

be a measure space and let / : X —> R. Recall that for 1 < p < oo, 

\\f\\p = [J\f\ P M~ P ' and ll/lloo = esssup/ = inf{a | fi{s £ X \ f(s) > 
a} = 0}. For 1 < p < oo, = {/ : X R | < oo}. The 
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Figure 3. A 3D version of the graph of the persistence 
landscape in Figure [2j 

seminorm || — || p on C P (X) induces a pseudometric on C P {X) given by 

d(f,g) = \\f-g\\ P . 

On N x R, we use the measure that is the product of the counting 
measure on N and the Lebesgue measure on R. For / : N x R 4 1, 



k=i 



where fk(t) = /(/c, t). If we extend / to / : 



as in 0, we have 



p, where we use the Lebesgue measure on M 2 for the latter. 



The following corollary follows from Property (3) of Lemma 2.8 
which implies that f{k ) t) is continuous for all k. 

Corollary 2.9. Let f G PL. If \\f\\ p = then f = 0. □ 

For 1 < p < oc, define 

PL P = PLn/7(N x R). 



By Corollary 2.9, the pseudometric on || — || p induced by the semi- 
p is a metric on PL P . We will consider PL P as a metric space 

3. Persistence landscapes and metrics 



norm | 

with this metric, g) 



In this section we relate the metric induced by the L p norm to the 
Wasserstein distance. 
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For 1 < p < oo, the metric induced by || — \\ p on PI/ induces an (ex- 



2.3 



tended) pseudometric on Vec^ R '-^ via the function A in Definition 
Call this the p-persistence landscape distance, denote d p . That is, for 
Mi,M 2 e Vec^, 

d p (M u M 2 ) = \\\(M 1 )-\{M 2 )\\ p . 

For 1 < p < oo, Cohen-Steiner, Edelsbrunner, Harer and Mileyko 
define a metric W p , called the pth Wasserstein distance on the set 
of finite persistence diagrams [3j. For p = oo, this is same as the 



bottleneck distance defined in [2j. Here, we use Theorem 2.1 to define 
it for finite type diagrams in Vec^. 

— 2 

First, we need some notation. For (a, b) G R , let ||(a, 6)||oo — 
max(|a|, \b\). For a finite type persistence diagram M, let D(M) de- 



note the union of the multiset D(M) (see Section |Z2 ) together with 
the multiset consisting of the points in A := {(a, a) | a G K} each with 
infinite multiplicity. 

Definition 3.1. Assume M,M f are finite type diagrams in 
For 1 < p < oo, let 

W P {M,M') P = min \\x - f{x)\\^ and let 

Woo (M,M 7 )= min max \\x - f(x)\\oo. 

/:D(M)^D(M0 XGdOm(/) 

Similarly, we can define W p and Woo fc> r finite persistence diagrams. 

The main result of this section is the following. It will follow directly 
from Corollaries 13.121 and 13.141 

Theorem 3.2. The metrics d p and W p+ i are topologically equivalent 
on finite persistence diagrams. Equivalently, the pseudometrics d p and 
W p +i are topologically equivalent on finite-type (R, <)-indexed diagrams 
of finite- dimensional vector spaces. 

As a warm up for the proof of this theorem, we prove the following 
two easier equalities. 

Theorem 3.3. Let M e Vec (R '-) have finite type. Then for 1 < p < 



oo, 



\\x(M)r p = -^w p+1 (M,$y +1 . 



STATISTICAL TOPOLOGY USING PERSISTENCE LANDSCAPES 11 

The right hand side in the above theorem can also be written in 
terms of total p ^-persistence , defined in [3]. For 1 < p < oo, let 

(8) pers p (M) = ^ (sup(7) - inf (J))*. 

/ei(M) 

We remark that 



W p (M,Q) p =—per8 p (M). 



1 

So the conclusion of Theorem |3.3| can also be written as, 
l|A(M)||^ = ^^pers p+1 (M). 

Proof. Assume M = xs-> where S = {1^2=1 an d h is an interval. First, 

HA(x/)iii; = n//iii; = 2 jf 2 ^* = ^n;(-^J =— 1 w P+1 (x I ,®). 

Next, by appropriately decomposing the domain of integration, 

n „ 

\\KM)\\ P P = Y^U(xu)\\ p P = -pylWJMr 1 . □ 

?'=1 

Theorem 3.4. Let M G Vec^'-) /iowe finite type. Then 

II A(ikf) ||oo = Woo(M 5 0). 

Proof. Assume M = X s, where 5 1 = {/j}f =1 . For 1 < i < n, let 
(^,60 = (inf(/i),sup(7;)) and ^ = Then ||/ (ai , 6i) |U = ft*. It 

follows that || A(iW) ||oo = max!<;< n h { = W^M, 0). □ 

We now establish a number of preliminary results that we will need 
in order to prove Theorem |3.2[ which will finally follow directly from 
Corollaries E32] and EH 

Proposition 3.5. Let M,M' £ Vecf'- } . Then, 

||A(M)-A(M')||oo<^ 00 (M,M / ). 

To prove this proposition, we will use the following two lemmas. 

Lemma 3.6. (1) Let I be a finite interval. Let x = (inf /, sup/) G 
R 2 , andx' = ( infJ+ 2 supJ , infJ t supJ )- Then ||A( X /)||oo = Wx-x'lU 
(2) Let I and V be finite intervals. Let x = (inf /, sup /) and x' = 
(inf/', sup/'). Then \\X( Xl ) - X( X i')\\oo < \\x - x'\ 



OO * 



Proof (1) Recall that A( X/ )(l,t) = /*(*). So \\X( X i)\\oo = \\f x \\oo = 

sup I— inf I 



2 1 1 % % 1 1 oo • 
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(2) ||A(M) - A(M')||oo = sup t |sup(s >0\a<t-s<t + s< 
b)-sup{s > | a' < t-s < t+s < b')\ < max(|a-o'|, \b-b'\) = 

1 1 X X 1 1 cx) • I— I 

Given a sequence ci, . . . , c n , let C(i), . . . , C( n ) denote a reordering of 
the sequence so that < • • • < C( n ). 

Lemma 3.7. If |q — di\ < 5 for i — 1, . . . n, then \c^) — d^)\ < S for 
k = 1, . . . , n. 

Proof Let k G {1, . . . , n). Since < • • • < c^), using the assumption 
k times, we see that 

#{1 < i < n | di < C(jfe) + 5} > k. 

Similarly, 

#{1 < i < n | a < + 5} > fc. 

It follows that d(k) < c^)+5 and c^) < d^)+5. Therefore \c^) — d^)\ < 
5. ^ ^ , ^ ^ 



Proof of Proposition \375} Assume M = xs, where 5 is the multiset 
{/*}?=!. Let 5 = Woo (M,M 7 ). Let * G R. Then {A(M)(fc, t)}^ = 
{A(x/J(l, *)}? =1 U{0} and similarly for A(M')(fc, £). By the definition of 
Woo and Lemma [3^6| there exists a bijection between {A(M)(/c, t)} and 
{A(M / )(/c, £)} such that the absolute value of the difference between 



corresponding elements is less than or equal to S. By Lemma 3.7 



|A(M)(M) " A(M')(M)| < 5 for all k. □ 
Lemma 3.8. Lei M, M' G Vecf '- } and 1 < p < oo. 7%en 

Woo(M,M') < W P (M,M'). 

Proo/. Fix / : D(M) = D{M f ). Let A = [£, Gdom(/) ||:r " /(*)!&>] ? - 

Let x G dom(/). Then A > [||x - = ||x - /(x)^. Therefore 

sup :rGdom (j> ) ||x — /(x)||co < A. The result follows. □ 

Lemma 3.9. Let M G Vec[f'- } . Then, 

fi ({(fc, t) | A(M)(fc, t) > 0}) = 2Wi(M, 0). 

Proo/. 2Wx(M,0) = 2^, = £ J€IW (sup I - inf J). The 

result follows from rearranging the domain of A(M). □ 

Proposition 3.10. Assume that M,M' £ Vec^ R '-^ have finite type. 
Then 

\\X(M) - X(M')\\ P < 2W 1 (M,<b)W 00 {M,M'y + -^W P+1 (M, M'f +1 ■ 
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Proof. Let / : D(M) D(M') be the bijection attaining the minimum 
in the definition of W P+1 (M, M r ). Now, 



||A(M)-A(M')||* = J (A(M) - X(M f )) p dfi 

-I 



(A(M) - A(M')f dfi 



\(M)(k,t)>0 



X(M')(k,t)>0 
\(M)(k,t)=0 



(X(M')) p dfjL 



Combining Proposition |3.5| and Lemma |3.9[ we see that the left hand 
term is bounded by 2W 1 {M, 0)VL oo (M, M'f. 

The right hand term is bounded by Execod(/)-A 2 J^ x ~ f ~ 1(x)lloc t p dt < 

E, edom (/) jfcW* ~ /(*)ll P oo +1 = ^W p+1 (M,Mr +1 . □ 

Corollary 3.11. Assume that M,M' G Vec^ R '-^ have finite type and 
that W p+1 (M, M') < W x (M, 0) . Then 



(9) ||A(M)-A(M')|| P < 2^(M,0) 



1 + 



p + 1 



W P+1 (M, M'). 



Proof. By Lemma |3.8[ the left hand term on the right side of the in- 
equality in Proposition |3.10| is bounded by 2W 1 (M, ®)W P+1 (M, M') p 
We thus have, 

2 

V + 



\\{M) - \(M')\\l < 2W 1 (M^)W p+1 (M,M'f + —^W p+1 (M,M'Y +l 

1 



< 2Wi(M,0) 



1 + 



p+1 



W p+1 (M,M'y. □ 



Corollary 3.12. Let M e VeCff'"- 1 . Let r > 0. Then there exists r' 
such that B r i{M; W p+1 ) C B r (M; d p ). 

Proof. Let k M , P denote the coefficient of W P+ ±(M, M') in Let r' = 
min(Wi(M, 0), j^-). Choose M' e VecJ'- } such that W P+1 (M, M') < 
r'. Then by Corollary |3.11 , 

||A(M) - A(M')|| P < k MtP W p+1 (M, M') < k M , p r' <r. □ 



Proposition 3.13. Let M, M' E Vec^'- } . // ||A(M) - X(M' 
sufficiently small, then 



is 



\\X(M)-\{M')\\l> 



p + 3 
(p + l)2f+ 1 



W P+1 (M,M'Y +1 . 
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Proof. Consider a < 6, a r < b r with e = ||(a, 6) — (a',?/)!!^. Assume 
that e « b — a. Recall that (m,h) = ^) and similarly for 

(m',ti). So ||(m,fr) - (m',/i')||i =e. 

We are interested in f( a ^ and f( a f ,b f )- Let £>, be the radius £ box in the 
1-norm centered at (to, /i). Then (to 7 , h') lies on the boundary of this 
box. Consider the intersection of B and the symmetric difference of the 
regions under the graphs f( a ^ and f( a f ,b f )- This region corresponds to 
part of the area between the curves f( a ^ and f( a f ,b f )- A minimizer for the 
contribution of this region to the integral \\f( a ,b) — /(a' ,6') lip * s ^ e 
(to 7 , ti) = (m + §£,/&+ From this we calculate, ||/ (aj6) - f( a ',v)\\p > 

(p+l)2P+i fc 

Let M G Vec[f'- } , where M = and 5 = {ii}? =1 . For 1 < % < n, 
let = (inf^sup/,). Choose M' e Vec[J'- } such that ||A(M) - 
A(M / )|| p is sufficiently small. Let g be the minimizer in the definition 
of W p+1 {M,M f ). For 1 < i < n, let e { = \\x - ^(x)^. Then ||A(M) - 

□ 



we 



Following the same argument as in the proof of Corollary |3.12 
have the following. 

Corollary 3.14. Let M 6 Vec^f . Let r > 0. Then there exists r' 
such that B r ,(M;d p ) C 5 r (M; V^+i). □ 

4. Stability of persistence landscapes 



In this section, we combine Proposition |3.10| and the stability theo- 
rems of [2] and [3] to get a stability theorem for persistence landscapes. 

Let X be a triangulable compact metric space. For a function / : 
X —> R, let H*(Xf,F) denote the (R, <) indexed diagram of vector 
spaces given by iJ*(X/,F)(a) = i7*(/ _1 ((oc, a]), F), for all aGK, and 
H*(Xf,W)(a < b) is the map induced on homology by the inclusion 
f-\(oo,a}) C f-\(-oc,b}). We say that / is tame if H*(X f ,W) e 
Vec[f'~\ We abuse notation by denoting H*(Xf,W) by just /. 

We assume that for some k > 1, there exists a constant Cx such that 
for all tame Lipschitz functions / : X —> R with Lipschitz constant, 
Lip(/), at most 1, pers k (f) < C x (see M). 



Theorem 4.1 (Landscape stability theorem). Let /, g : X — R 6e iu>o 
iame Lipschitz functions. Then 

d p (f, gf < P er Sl (/)||/ - ^ + -^-rCU/ - 
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where C = C x max(Lip(/) fe , Lip(g) fc ). 

Proof. The theorem follows by applying the Main Theorem of [2] and 

the Wasserstein Stability Theorem of [3j to Proposition 3.10, □ 



5. Statistical inference with persistence landscapes 

In this section we prove a number of results useful for conducting 
statistical inference for persistent homology. We show that the spaces 
PI/ are complete and separable. For p = 2, we prove that the Frechet 
mean and variance and can be calculated pointwise and we give corre- 
sponding limit theorems. 

5.1. Polish space. A good setting for probability theory is a metric 
space that is complete and separable. A metric allows one to consider 
convergence. Completeness allows one to use Cauchy sequences, and 
separability avoids certain measure-theoretic difficulties. 

Recall that PI/ = PLn/?(N xR) = PLnZ/(N x K). 

Proposition 5.1. Let 1 < p < oc. Then PI/ is separable. 

Proof. For 1 < p < oc, L P (N x 1) is a separable metric space. Since a 
subspace of a separable metric space is separable, the result follows. □ 

Proposition 5.2. Let 1 < p < oc. Then PI/ is complete. 

Proof. Let {/i}iGN be a Cauchy sequence in PI/. Since {fi} is a Cauchy 
sequence in L P (N xl), it is well known fact that {fi} converges point- 
wise a.e. and in L P (N x R) to a function / e L P (N x M). In our case, 
each fi is continuous, so in fact {fi} converges to / pointwise. Since 
/ is the pointwise limit of 1-Lipschitz functions, it is 1-Lipschitz. It 
follows that / G PI/. □ 

Combining the two propositions above, we have the following. 

Theorem 5.3. For 1 < p < oo, PI/ is a complete and separable metric 
space, i.e., it is a Polish space. □ 

5.2. Preliminaries. Here we prove some results that we will be useful 
later on. In this section, we assume that 1 < p < oc. 

Lemma 5.4. Let f and g be 1-Lipschitz functions. Let t G R. Then 

\\f-9\\ p P >^\m-9(t)\ p+i . 
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Proof. Let a = \ f(t) — g(t)\. Since \f — g\ is 2-Lipschitz, 

11/ -g\\ p P = f 1/00 - 9(s)\ p ds > |/oo - rf s 



Corollary 5.5. Lei / andg 6e 1 - Lip schitz functions. Lett G R. T/ien 

i/(t)-^)i<2||/-^ir T . 



Proof. By Lemma 



5.4 



|/(*) -<?(*)! < (p+^^jlZ-^lir 1 - ^ is an 
easy calculus exercises to check that for x > 0, x* has a maximum at 
x = e, and < 2. □ 

Let e N x R. Define ev (M) : PL R by ev (M) (/) = /(M)« 

Lemma 5.6. Lei 1 < p < oc and Zef G N x R. Then the map 

ev(k,t) : PL P — >• R is continuous. 

Proof. Choose e > 0. Set S = (-L-e^Y . Let f,g G PL P such that 



11/ ~~ fl'llp < ^- By Lemma 5.4 



11/ - <?ll£ = En/* - ^Hp ^ ii/* - ^Hp ^ ^Tll^' *) - g ^ ^ p+1 

i=l 



Therefore \f(k ) t) — g{h,t)\ < ((p+ = e. Thus ev(k,t) is con_ 

tinuous. □ 

Define £:NxRxPL^Rby E(k, t, f) = /(fc, *) . 

Lemma 5.7. T/ie map continuous. 

Proof. Choose e > 0. Let 5 = min (l, (§) ^ . Assume (Jfc, t, /), /') G 

N x R x PL P such that max(|fc - fc'|, |t - f|, ||/ - f\\ p ) < 8. Since 
5 < 1, this implies that k = k'. Using the fact that is 1-Lipschitz 
and Corollary |5.5[ 



|/(M) - /'(MO I < l/(M) - /(MO I + - 

< |* - t'\ + 2||/ - fwf 1 <S + 2S^r < 3S^ = e. 

Therefore E is continuous. □ 

Since E is continuous, E is measurable. This will allow us to apply 
Fubini's theorem. 
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5.3. Frechet mean and variance. A general notion of mean that 
can be defined in any metric space is the Frechet mean, which we now 
define. 

Let V be a probability measure on (PI/,£>) where B = £>(PL P ) is 
the a- algebra of Borel sets in PL P . 

Definition 5.8. Define the Frechet function, F v : PI/ — >■ R by 

Mf)= [ \\f-g\\ 2 p V(dg). 

Define the Frechet variance of V by 

Var P = inf F v (f). 

Define the Frechet mean set of V , to be the set {/ G PI/ | Fp(f) = 
Var^}. If the Frechet mean set contains a single element, we refer to 
that element as the Frechet mean of V. 

Example 5.9. If V is a probability measure on R and X ~ V, then 
the Frechet mean of V is E[X], and the Frechet variance of P is Var[X]. 



Let 1 < p < oc. Let G N x R. By Lemma 5.6, ev(k,t) : 

(PL P , B, V) -> (R, B(R)) is a random variable. So P (M) := (ev (M) )*P = 
P is a probability measure on R. 

Definition 5.10. Define ft : N x R -> R by ft(M) = where 
X(k,t) ~ P(jfe,t)- That is, ft(M) = / R s V(k,t)(ds) = J FLP f{k,t) V(df). 
Lemma 5.11. ft G PL. 

Proof. We s how that ft G PL by checking the three conditions in 
Let (k,t) G N x R. Notice that h(k,t) = J sV {k ,t)(ds). 



Lemma 



2.8 



For (1), p (M) (-oo,0) = p{/ G PL^ | /(fc,t) < 0} = 0. Thus 
h(k,t) = / °° sV m (ds) > 0. For (2), let k < t G N. Then 

ft(fc, f) = / f(k, t)T(df) > [ f(i, t)T(df) = h(i, t). 

JPL P JPLP 

For (3), \h(k,t) - h(k,s)\ = J PLP \f(k,t) - f(k,s)\V(df) < J pLP \t - 
s\V(df) = \t-s\. □ 

Lemma 5.12. // / plp ||/||p dV < oo, then h e C P (N x R). 



Proof. Using Fubini's theorem, which we can apply thanks to Lemma [577 

i p - ' h(k,ty dn= I [ f(k,ty dv dn 



V 



NxM 



/ / f(k,ty dixdv= [ \\f\\ p p dv<oo. □ 

JPLP JNxR Jplp 
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Remark 5. 13. S ince is 1-Lipschitz by Lemma 5.11| from the proof 
of Lemma 5.12, we see that if / PLP ||/||p dV < oo, then h{k,t) < oc for 
all k. t. 



Lemma 5.14. Assume p=2. Then F-p(f) = J NxR F V(kt) (f(k,t)) dji. 
Proof. By Fubini's Theorem, which we can apply using Lemma [5. 7\ 



Mf) 



(/(M) 
(/(M) 

NxR JPL 2 

(/(M) 

Nxl 



-g(k,t)f d//P(<fc) 
-g(k,t)) 2 V(dg)dfi 
yf V(k,t)(dy)d/j, 



J 

JNxR 



F V(kt) (f(k,t)) d/i. □ 



Theorem 5.15. Assume p = 2 and Jj 



PL? 5 



p rfP < oc. 



Frechet mean ofV is h and the Frechet variance ofV is J NxR Var<p (fc 



Proof. By Example 5.9, P^t) has Frechet mean h(k,t). Applying this 
to Lemma [5.14[ we get that Fj> has Frech et mean h. Furthermore, again 



combining Lemma 



5.9 



and Lemma 



5.14 



F v (h) 



/NxM Var nfe,t) 



dji. □ 



5.4. Limit theorems. In this section we prove some pointwise limit 
theorems for persistence landscapes. 

Let V be a probability measure on the sp ace P L 2 = PLn£ 2 (M 2 ), 
with J pLP | 
mean of V. 

Let Ai, . . . , X n G PL 2 be an iid sample from V . Let V 



v v dV < oc. Then by Theorem 



5.15 



h-p is the Frechet 



5.15 



denote the empirical measure. Then by Defini tion |5.10[ for all G 
NxR, h(k,t) = ^Z)*Li Ai(fc,t). So by Theorem 
A n is the Frechet mean of V . By Theorem 
of V is given by Var^ = ±||A n - A ?; 



5.15 



the Frechet variance 



Now fix (k,t) G N x 



12 

'H||2- 

For 1 < i < n, let X 



A,(fc,t) 



5.13 



,et X n := 
H < oc. 



n ELi ^ = ^n(^j*) an d f 1 := h-p(x,t). By Remark 
Also, E[Xj] = /x. Recall that ev(^)(A) = A(fc,t) and 
Thus Xi , . . . , X n is an iid sample of integrable random variables from 
V(k,t) with E[Xi] = \i < oc. So we can apply the Strong Law of Large 
Numbers. 

Theorem 5.16 (Strong Law of Large Numbers). As n —> oc ; 



X„ 



-> 11. 
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That is, Pr (lim n ^ 00 X n — fij — 1. □ 

Corollary 5.17 (Pointwise convergence of the sample mean to the 
Frechet mean). For all (k,t) G N x R, 

\ n (k,t) h v (k,t). 

□ 

Now assume that X { has finite variance. Let a 2 = Var[JQ]. Then 
we have the following from the Central Limit Theorem. Let AT(/x, a 2 ) 
denote the normal distribution with mean /i and variance a 2 . Let <£> 
denote the standard normal cumulative distribution function, <&(x) = 



Theorem 5.18 (Central Limit Theorem). yls n — > 00, 

TTiai is, Vr G K, lim^oo Pr (y/n(X n - y) < r) = $ (£) . □ 

Corollary 5.19 (Pointwise CLT for the sample mean). For a// (fc, £) G 
NxR, 

(K(k, t) - hp(k, t)) A N(0, a 2 ). 

□ 

Recall that the Frechet variance of the empirical measure is given by 
Var^ = iE^ill^-^ll2 = ^/(^(A:,t)-A n (A;,t)) 2 dfi. So, 



Var^ = J s 2 n (k,t) d/n. 



where s^(fc, i) = ^ X^ILi (A^(/c, £) — A n (fc, i)) 2 is the biased sample vari- 
ance. Using s 2 {k ) t) = Y^i=i {^i{k"> t) ~ A n (^ i)) 2 , the unbiased 
sample variance, we have 



Var^ = — / s 2 (k ) t) da. 

n J 



5.5. Interpretation of the average persistence landscape. For 

a sample Ai,...,A n of persistence landscapes, we call the empirical 
Frechet mean, A n , the average persistence landscape. Consider barcodes 
2?i, . . . , B n and the corresponding persistence landscapes Ai, . . . , \ n . 
For k G N and t G R, A n (fc, t) is the average value of the largest radius 
interval centered at t that is contained in k intervals in the barcodes 
Si, ... , B n . 
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6. Examples 

6.1. Persistence landscape of a torus. Consider an iid sample of 
1000 points from a torus embedded in M 3 using the uniform surface 
area measure 




For these points, we construct a filtered simplicial complex as fol- 
lows. First we triangulate the underlying space using the Coxeter- 
Freudenthal-Kuhn triangulation. Next we smooth our data using a 
triangular kernel. We evaluate this kernel density estimator at the ver- 
tices of our simplicial complex. Finally, we filter our simplicial complex 
by taking the flag complex on lower excursion sets of the vertices. That 
is, for filtration level r, we include a simplex in our triangulation if and 
only if the kernel density estimator has values less than or equal to r 
at all of its vertices. Here are three stages in the filtration. 




We then calculate the persistent homology of this filtered simplicial 
complex using, say, javaPlex [8j. For the persistent homology in di- 
mension 1, here is the corresponding persistence landscape, in both 2d 
and 3d versions. 
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6.2. Average persistence landscape of a torus. Now if we repeat 
this 10 times, we have a sample Xi, X 2 , . . . , Xi of ten persistence land- 
scapes. 






We calculate their Frechet mean by averaging pointwise. 




We remark that this average persistence landscape is not the per- 
sistence landscape corresponding to a barcode or persistence diagram. 
However, there is a direct interpretation of the average persistence 



landscape in terms of barcodes (see Section 5.5). 
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We can get a sense of the stability of the features of this average 
persistence landscape by subtracting (on the left) and adding (on the 
right) 2 standard deviations pointwise. 




6.3. Comparing landscapes for a torus and a sphere. Now let 

us apply statistical inference to persistence landscapes obtained from 
a torus and sphere with the same surface area. 




For k = 0, 1, 2, let X\ , . . . , X^ Q be an iid sample of persistence 
landscapes in dimension k obtained from 1000 iid points on a torus 
sampled using the uniform surface area measure. Let Y^, . . . , Y± 
be the corresponding sample of persistence landscapes from a sphere. 
Here are the average persistence landscapes. In rows 1, 2 and 3, we 
have the average persistence landscape in dimension 0, 1 and 2 of the 
torus on the left and the sphere on the right. 
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In dimension 0, the rf 2 distance between the average landscapes of 
the torus and sphere is 48.8 and the square root of the Frechet variances 
of the torus and sphere and 123.4 and 143.8, respectively. In dimension 
1, we obtain a distance of 370.0 and square root of Frechet variances of 
69.6 and 57.6. In dimension 2, these numbers are 26.6, 23.7, and 24.8. 

We use the permutation test with 10,000 repetitions to determine if 
the difference between the two samples is statistically significant. Com- 
paring the two samples for dimension 0, the difference is not significant 
(p value 0.6106). In dimension 1, the difference is significant with a 
p value of 0.0000. Restricting to each of k — 1 and k = 2 we have p 
values of 0.0000 and 0.0217, respectively. Restricting to k > 2, there 
is no significant difference (p value 0.6570). In dimension 2, we get a 
p value of 0.0088. However if we restrict to k — 1 the difference is not 
significant (p value 0.9997). 
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6.4. Comparing landscapes from noisy samples. We now repeat 
the analysis of the previous section with the addition of Gaussian noise 
to the point samples. On the left we have 1000 points sampled from a 
torus, from the perspective that makes it easiest to see the hole in the 
middle. On the right we have points sampled from the sphere. 




We calculate persistent homology in the same way as in Section [O 
Here are the average persistence landscapes. In rows 1, 2 and 3, we have 
the average persistence landscape in dimension 0, 1 and 2, respectively 
with the torus on the left and the sphere on the right. 
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Again we calculate the d 2 distances between the average landscapes 
and the square root of the Frechet variances from the torus and the 
sphere. In dimension 0, they are 69.7, 67.3 and 78.1. In dimension 1, 
we have 59.5, 23.5 and 26.4. In dimension 2, the values are 44.5, 7.2 
and 10.2. 

Again, we use the permutation test to determine if the difference 
between the two samples is statistically significant. 

There is a significant difference in dimension 0, with a p value of 
0.0111. This is surprising, since the average landscapes look very sim- 
ilar. However, on closer inspection, they are shifted slightly. Here is a 
graph of the difference. 




Note that we are detecting a geometric difference, not a topological one. 
Less surprisingly, there is also a significant difference in dimensions 1 
and 2, with p values of 0.0000 and 0.0000, respectively. 
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